Fast Similarity Search in Three-Dimensional Structure Databases
نویسندگان
چکیده
Given a database D of three-dimensional (3D) molecular structures and a target molecule Q, the similarity search problem is to find the molecules O in D that match Q after allowing for an arbitrary number of whole-structure rotations and translations as well as a certain number of edit operations. The edit operations include relabeling an atom, deleting an atom, and inserting an atom. This search operation arises in many biochemical applications. In this paper we study the similarity search problem and a class of related queries. We present a computer vision based technique, called geometric hashing, for processing these queries. Experimental results on a database of 3D molecular structures obtained from the National Cancer Institute indicate the good performance of the presented technique.
منابع مشابه
Ultrafast shape recognition for similarity search in molecular databases
Molecular databases are routinely screened for compounds that most closely resemble a molecule of known biological activity to provide novel drug leads. It is widely believed that three-dimensional molecular shape is the most discriminating pattern for biological activity as it is directly related to the steep repulsive part of the interaction potential between the drug-like molecule and its ma...
متن کاملMLR-Index: An Index Structure for Fast and Scalable Similarity Search in High Dimensions
High-dimensional indexing has been very popularly used for performing similarity search over various data types such as multimedia (audio/image/video) databases, document collections, time-series data, sensor data and scientific databases. Because of the curse of dimensionality, it is already known that well-known data structures like kd-tree, R-tree, and M-tree suffer in their performance over...
متن کاملFast similarity search on video signatures
Video signatures are compact representations of video sequences designed for efficient similarity measurement. In this paper, we propose a feature extraction technique to support fast similarity search on large databases of video signatures. Our proposed technique transforms the high dimensional video signatures into low dimensional vectors where similarity search can be efficiently performed. ...
متن کاملUtilization of Principle Axis Analysis for Fast Nearest Neighbor Searches in High-Dimensional Image Databases
This paper presents an efficient indexing method for similarity searches in highdimensional image database by principal axis analysis. Image databases often represent the image objects as high-dimensional feature vectors and access them via the feature vectors and similarity measure. However, the performance of the existing nearest neighbor search methods is far from satisfactory for feature ve...
متن کاملFast indexing: a comparative evaluation
In this evaluation the efficiency of three image signature called angular spectrum, Hough based signature and color histogram are tested The first signature is intrinsic hierarchical (deriving from image frequency spectrum) and than non signature space reduction technique is used. The second is a short signature directly indexed and the last (color histogram) need to be reduced for fast indexin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of chemical information and computer sciences
دوره 40 2 شماره
صفحات -
تاریخ انتشار 2000